Adaptive Critic Based Adaptation of A Fuzzy Policy Manager for A Logistic System
نویسندگان
چکیده
We show that a reinforcement learning method, adaptive critic based approximate dynamic programming, can be used to create fuzzy policy managers for adaptive control of a logistic system. Two different architectures are used for the policy manager, a feed forward neural network, and a fuzzy rule base. For both architectures, policy managers are trained that outperform LP and GA derived fixed policies in stochastic and non-stationary demand environments. In all cases the fuzzy system initialized with expert information outperforms the neural network. Index terms -applications, neural networks, reinforcement learning, genetic algorithms, qualitative reasoning, rule learning
منابع مشابه
Adaptive critic based approximate dynamic programming: A new tool for smart manufacturing
This work supported in part by the National Science Foundation under grant ECS-9904378. Abstract Adaptive critic based approximate dynamic programming techniques are gradient based methods for finding optimal policies for multi-stage decision processes. We believe adaptive critic methods are now developed to the point that they can be applied to the full spectrum of decision and control problem...
متن کاملIS-MRAS With On-Line Adaptation Parameters Based on Type-2 Fuzzy LOGIC for Sensorless Control of IM
This paper suggests novel sensorless speed estimation for an induction motor (IM) based on a stator current model reference adaptive system (IS-MRAS) scheme. The IS-MRAS scheme uses the error between the reference and estimated stator current vectors and the rotor speed. Observing rotor flux and the speed estimating using the conventional MRAS technique is confronted with certain problems relat...
متن کاملIndirect Adaptive Interval Type-2 Fuzzy PI Sliding Mode Control for a Class of Uncertain Nonlinear Systems
Controller design remains an elusive and challenging problem foruncertain nonlinear dynamics. Interval type-2 fuzzy logic systems (IT2FLS) incomparison with type-1 fuzzy logic systems claim to effectively handle systemuncertainties especially in the presence of disturbances and noises, but lack aformal mechanism to guarantee performance. In contrast, adaptive sliding modecontrol (ASMC) provides...
متن کاملDesign and implementation of an adaptive critic-based neuro-fuzzy controller on an unmanned bicycle
Abstract: Fuzzy critic-based learning forms a reinforcement learning method based on dynamic programming. In this paper, an adaptive critic-based neuro-fuzzy system is presented for an unmanned bicycle. The only information available for the critic agent is the system feedback which is interpreted as the last action performed by the controller in the previous state. The signal produced by the c...
متن کاملVoting Algorithm Based on Adaptive Neuro Fuzzy Inference System for Fault Tolerant Systems
some applications are critical and must designed Fault Tolerant System. Usually Voting Algorithm is one of the principle elements of a Fault Tolerant System. Two kinds of voting algorithm are used in most applications, they are majority voting algorithm and weighted average algorithm these algorithms have some problems. Majority confronts with the problem of threshold limits and voter of weight...
متن کامل